REGULARITY PROPERTIES OF THE STERN ENUMERATION 

OF THE RATIONALS 



BRUCE REZNICK 



Abstract. The Stern sequence s(n) is defined by s(0) = 0, s(l) = 1, s(2n) = s(n), 
s(2n + 1) = s(n) + s(n + 1). Stern showed in 1858 that gcd(s(n), s(n + 1)) = 1, and 
that for every pair of relatively prime positive integers (a, b) there exists a unique 
n > 1 with s(n) = a and s(n + 1) = b. We show that in a strong sense, the average 
value of s (^"\) is §, and that for d > 2, (s(n),s(n + 1)) is uniformly distributed 
among all feasible pairs of congruence classes modulo d. More precise results are 
presented for d = 2 and 3. 



1. Introduction and History 

In 1858, M. A. Stern ^H] defined the diatomic array, an unjustly neglected math- 
ematical construction. It is a Pascal triangle with memory: each row is created by 
inserting the sums of pairs of consecutive elements into the previous row. 

a b 

a a + b b 

a 2a + b a + b a + 2b b 



(1.1) 



a 3a + b 2a + b 3a + 2b a + b 2a + 3b a + 2b a + 3b b 



When (a, b) = (0, 1), it is easy to see that each row of the diatomic array repeats 
as the first half of the next row down. The resulting infinite Stern sequence can also 
be defined recursively by: 

(1.2) s(0) = 0, 8(1) = 1, s{2n) = s(n), s{2n + 1) = s(n) + s{n + 1). 

Taking (a, b) = (1, 1) in (1.1), we obtain blocks of (s(n)) for 2 r < n < 2 r+1 . Although 
s(2 r ) = 1 is repeated at the ends, each pair (s(n),s(n + 1)) appears below exactly 
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once as a consecutive pair in a row: 

(r = 0) 11 
(r = 1) 12 1 

(1.3) (r = 2) 1 3 2 3 1 

(r = 3) 143525341 



Mirror symmetry (or an easy induction) implies that for < k < 2 r , we have 

(1.4) s(2 r + k) = s(2 r+1 -k). 
In his original paper, Stern proved that for all n, 

(1.5) gcd(s(n),s(n + 1)) = 1; 

moreover, for every pair of positive relatively prime integers (a,b), there is a unique 
n so that s(n) = a and s{n + 1) = b. Stern's discovery predates Cantor's proof of 
the countability of Q by fifteen years. This property of the Stern sequence has been 
recently made explicit and discussed in Another enumeration of the positive 
rationals involves the Stern-Brocot array, which also predates Cantor; see jS], pp. 
116-123, 305-306. This was used by Minkowski in defining his ?-function; see |14j . 
The Stern sequence and Stern-Brocot array make brief appearances in Dickson's 
History, see 0, pp. 156, 426. Apparently, de Rham [5j was the first to consider the 
sequence (s(n)) per se, attributing the term "Stern sequence" to Bachmann [2], p. 
143, who had only considered the array. The Stern sequence has recently arisen as 
well in the discussion of 2-regular sequences pQ and the Tower of Hanoi graph [TUj . 
Some other Stern identities and a large bibliography relating to the Stern sequence 
are given in TO . A further discussion of the Stern sequence will be found in [16]. 
Let 

/ \ s(n) 

(1.6) t(n) - 



s{n + 1) 

Here are blocks of (t(n)), for 2 r < n < 2 r+1 for small r: 

(r = 0) 
(r = l) 

(1.7) (r = 2) 



3) 



i 
i 

1 2 

2 1 

13 2 3 

3 2 3 1 
14352534 
43525341 
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In Section 3, we shall show that 

3N 



;i.8) = — + O(log 2 A0 



n=0 

so the "average" element in the Stern enumeration of Q + is |. 
For a fixed integer d > 2, let 

(1.9) Sd(n) := (s(n) mod d, s(n + 1) mod d) 
and let 

(1.10) Sd = {(i mod d,j mod d) : gcd(i, j, d) = 1}. 

It follows from (1.5) that Sd{n) G Sd for all n. In Section 4, we shall show that for 
each d, the sequence (Sd(n)) is uniformly distributed on Sd, so the "probability" that 
s(n) = i (mod d) can be explicitly computed. More precisely, let 

(1.11) T(N- d,i) = \{n:0<n<N k s(n) = i mod d}\ . 
Then there exists < 1 so that 

(1.12) T(N;d,i)=r d)i N + 0(N T *), 
where 

In particular, the probability that s(n) is a multiple of d is 1(d) -1 , where 

(1.14) 1(d) = dT\ e N. 

P I d F 

In Section 5, we present more specific information for the cases d = 2 and 3. It 
is an easy induction that s(n) is even if and only if n is a multiple of 3, so that 
r 2 = 0. We show that r 3 = | and give an explicit formula for T(2 r ; 3, 0), as well as a 
recursive description of those n for which 3 | s(n). We also prove that, for all iV > 1, 
T(iV;3,l)-T(iV;3,2)G{0,l,2,3}. 

It will be proved in ^H] that 

(1.15) T(2 r ;4,0) = T(2 r ;5,0), T(2 r ; 6, 0) = T(2 r ; 9, 0) = T(2 r ; 11, 0); 

we conjecture that T(2 r ; 22, 0) = T(2 r ; 27, 0). (The latter is true for r < 19.) These 
exhaust the possibilities for T(2 r ;Ni,0) = T(2 r ;iV 2 ,0) with JV< < 128. Note that 
J(4) = 7(5) = 6, 7(6) = 1(8) = 1(9) = 7(11) = 12 and 7(22) = 7(27) = 36. However, 
T(2 r ; 8, 0) 7^ T(2 r ; 6, 0), so there is more than just asymptotics at work. 
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2. Basic facts about the Stern sequence 

We formalize the definition of the diatomic array. Define Z(r, k) = Z(r, k; a, b) 
recursively for r > and < k < 2 r by: 

Z(0,0) = a, Z(0,l) = 6; 

^ ' ' Z(r + 1, 2k) = Z(r, k), Z(r + 1, 2k + 1) = Z(r, fc) + Z(r, k + 1). 

The following lemma follows from (1.2), (2.1) and a simple induction. 

Lemma 2.1. For < k < 2 r , we have 

(2.2) Z(r,fc;0,l) = s(fc). 

Lemma 2.1 leads directly to a general formula for the diatomic array. 
Theorem 2.2. For < k < 2 r , we have 

(2.3) Z(r, k; a, b) = s(2 r - k)a + s(k)b. 

Proof. Clearly, Z(r, k; a, b) is linear in (a, b) and it also satisfies a mirror symmetry 

(2.4) Z(r, k; a, b) = Z(r, 2 r - k; b, a) 
for < k < 2 r , c.f. (1.4). Thus, 

(2.5) Z(r, k; a, b) = aZ(r, k; 1, 0) + bZ(r, k; 0, 1) = aZ(r, 2 r - k; 0, 1) + bZ(r, k; 0, 1). 

The result then follows from Lemma 2.1. □ 

The diatomic array contains a self-similarity: any two consecutive entries in any 
row determine the corresponding portion of the succeeding rows. More precisely, we 
have a relation whose simple inductive proof is omitted, and which immediately leads 
to the iterated generalization of (1.2). 

Lemma 2.3. If0<k<2 r and < k < 2 r ° - I, then 

(2.6) Z(r + r , 2 r k + k; a, b) = Z(r, k; Z(r , k ; a, b), Z(r , k + 1; a, b)). 
Corollary 2.4. If n > and < k < 2 r , then 

(2.7) s(2 r n + k) = s(2 r - fc)s(n) + s(k)s(n + 1). 

Proof. Take (a, b, k , r ) = (0, 1, n, [log 2 (n+ 1)]) in Lemma 2.3, so that k < 2 r °, and 
then apply Theorem 2.2. □ 

We turn now to t(n). Clearly, t(2n) < 1 < t(2n + 1) for all n; after a little algebra, 
(1.2) implies 

(2.8) t(2n) = , t(2n+ 1) = 1 +t(n). 

The mirror symmetry (1.4) yields two other formulas which are evident in (1.7): 

(2.9) t(2 r + A;)t(2 r+1 -A;-l) = l, 
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, + 1 M _ • s '(-' ' - k - 1) _ s(2 r + k+l) 



for < k < 2 r — 1, which follows from 

(2.10) t(2 r+1 -k-l, 

K ' y s{2 r ^-k) s(2 r + k) t(2 r + k)' 

and 

(2.11) t{2 r + 21) + t{2 r+1 - 21 - 2) = 1, 
for r > 1 and < 21 < 2 r — 2, which follows from 

s(2 r + 2£) s(2 r+l -2£-2) s(2 r + 21) s(2 r + 2^ + 2) 

f^.l^) — — J — ~\~ — — ~~~~ ~"T~ ~ ~~ ~"T~ ~\~ 

V / _ / Or i O li i 1 \ _ / Or4- C\ f) 1 \ „ / Of i O / i 1 \ 



s(2 r + 2£ + l) s(2 r+1 -2^-l) s(2 r + 2£ + l) s(2 r + 2^+l)' 

since s(2m) + s(2m + 2) = s(2m + 1). 

Although we will not use it directly here, we mention a simple closed formula for 
t(n), and hence for s(n). Stern had already proved that if 2 r < n < 2 r+1 , then the 
sum of the denominators in the continued fraction representation of t(n) is r + 1; this 
is clear from (2.8). Lehmer jTTj gave an exact formulation, of which the following is 
a variation. Suppose n is odd and [n]2, the binary representation of n, consists of a 
block of ai l's, followed by a<i 0's, 03 l's, etc, ending with a 2v 0's and g^+i l's, with 
dj > 1. (That is, n = 2 ai+ - +a2 -+ l - 2 a2+ - +a2 -+ 1 ± • • • ± 2 a ^ +1 - 1.) Then 

/ \ s(n) p I 
(2.13) t{n) = ,\\. = - = a 2v+1 + 



n + 1J q 1 

a 2v H 



1 

••• + — 

a\ 

Conversely, if | > 1 and (2.13) gives its presentation as a simple continued fraction 
with an odd number of denominators, then the unique n with t(n) = 2 has the binary 
representation described above. (If n is even or - < 1, apply (2.9) first.) 

The Stern-Brocot array is named after the clockmaker Achille Brocot, who used it 
[3] in 1861 as the basis of a gear table; see also 0. This array caught the attention 
of several French number theorists, and is discussed in • It is formed by applying 
the diatomic rule to numerators and denominators simultaneously: 

(r = 0) 



1 

1 
1 1 



(^ = 1) , , n 

(2.14) (r = 2) S \ \ \ \ 

( r _ o\ 011213231 

v °) 132312110 



This array is not quite the same as (1.7). If | and | are consecutive in the r-th 
row, then they repeat in the (r + l)-st row, separated by It is easy to see that 

the elements of the r-th row are 70zp, , < k < 2 r . It is also easy to show that the 
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elements of each row are increasing, and moreover, that they share a property with 
the Farey sequence. 

Lemma 2.5. For < k < 2 r - 2, 

s(k + l) s(k) 1 



(y2 ' 15 ^ s{2 r -k-l) s(2 r -k) s(2 r - k)s{2 r - k- 1)' 

That is, 

(2.16) s(k + l)s{2 r - k) - s{k)s{2 r - k - 1) = 1. 

This lemma has a simple proof by induction, which can be found in ^2j, p. 467 and 

S p.n- 

The "new" entries in the (r + l)-st row of (2.14) are a permutation of the r-th row 
of (1.7). The easiest way to express the connection (see [16J) for rationals - > 1 is 
that if < k < 2 r is odd, then 



p_s(2 r + k)_ s(2 r + k) 



q s(2 r -k) s (2 r + k + l) J 

where *n denotes the integer so that [n] 2 and ] yn}2 are the reverse of each other. If 
| < 1, then apply mirror symmetry to the instance of (2.17) which holds for -. 

The Minkowski ?-function can be defined using the first half of the rows of (2.14). 
For odd t, < t < 2 r , 

(2.18) 7 ( , f 

This gives a strictly increasing map from Q D [0, 1] to the dyadic rationals in [0, 1], 
which extends to a continuous strictly increasing map from [0, 1] to itself, taking 
quadratic irrationals to non-dyadic rationals. 

Finally, suppose iV is a positive integer, written as 

(2.19) N = 2 ri +2 r2 + --- + 2 r \ n > r 2 > ••• > r v . 
We shall define 

(2.20) 7V = 0; Nj = 2 ri + ■ ■ ■ + 2 r > for j = 1, . . . , v. 
Further, for l<j<v, let Mj = 2~^N j+1 , so that 

(2.21) Nj = Nj-! + 2 r i = 2 r *(Mj + 1) = 2 ri ~ 1 M j - 1 . 
and, for a < b E Z, let 

(2.22) [a,b) := {k G Z : a < k < b}. 
Our proofs will rely on the observation that 

V—l V 

(2.23) [0,N) = \J[Nj,N j+1 ) = \J[2 r mj,2 r i(Mj + 1)), 
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where the above unions are disjoint, so that, formally, 

N-l v -l N j+1 -1 v 2 r J(M j +l)-l 

(2.24) y>yj E = E E ■ 

n=0 j=0 n=Nj j=l n =2 r iM j 

3. The Stern- Average Rational 
We begin by looking at the sum of t(n) along the rows of (1.7). Let 

2 r+1 -l 2 r -l r-1 

(3.1) A(r)= ^2 t(n) and A(r) = t(n) = A(i). 

n=2 r n=0 i=0 

Lemma 3.1. For r > 0, 

(3.2) A {r) = \-T- 1 - and A(r) = 3 - ■ T - ^ . 
Proof. First note that A(0) = t(l) — \ — § - \. Now observe that for r > 0, 

2 r + l_ 1 2 r -l 2 r -l 

(3.3) A(r + 1) = t (2 , ' +1 = f ( r+1 + 2k ) + Yl + 2k + 1 )- 

j=0 k=0 k=0 

Using (2.11) and (2.8), we can simplify this summation: 

2 r -i 1 /2 r -l \ 

(3.4) t ( r+1 + 2k ) = 2 (Yl ^ 2r+1 + 2A; ) + t{T+2 - 2A; - 2) J = 2 r ~\ 

k=0 \k=Q J 

and 

2 r -l 2 r -l 

(3.5) ^ t(2 r+l + 2A; + 1) = ^ (l + t(2 r + fc)) = 2 r + A(r). 

fc=0 fc=0 

Thus, A(r+1) = 2 r ~ 1 +2 r + A(r) , and the formula for A(r) is established by induction. 
This also immediately implies the formula for A(r). □ 

Lemma 3.2. If m is even, then 

2 r -l 

(3.6) A(r) <^t(2 r m + k) < A(r). 

k=0 

Proof For fixed (k,r), let 

(3 ' 7) ® kAx) - s(2r - (k + l))x + s(k + iy 

Then it follows from (2.16) that 

(38) & (x) _ s(k + l)*(2 r -k)-s(k)s(2'-k-l) 

(6.6) ^ r (x) - ^ _ (jfc - i))x - - 1))2 > U. 
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Using (2.7), we see that 

s(2 r m + k) s(2 r - k)s(m) + s(k)s(m + 1) 

1,(2 rn — k) = 



s(2 r m + k + l) s(2 r - k - l)s(m) + s(k + l)s(m+ 1) 

(3.9) 



/ s(m) \ ^ . . ., 

=Mi(^J = *'" Wm)) ' 



Since m is even, < t(m) < 1; monotonicity then implies that 

(3.10) i(Jfe) = $ fc , r (0) < t(2 r m + k) < $ r , fc (l) = t(2 r + k). 

Summing (3.10) on k from to 2 r — 1 gives (3.6). □ 

We use these estimates to establish (1.8). 
Theorem 3.3. If2 r <N < 2 r+1 , then 

3N r 2 + 7r + 6 ^ . , 3N 1 

(3.H) x -^E^Xy-j- 

?1=0 

Proof. Recalling (2.24), we apply Lemma 3.2 for each j, with r = 7j and m = Mj, so 
that 



(3-12) *.r>-*±*< ^ , W< 3.^_1 

n=Nj-i 

After summing on j, we find that 

jW_n + --- + r. + 3» < ^ Sjjf_« 
v ' 2 2 ~ ^ 2 2 

n II 

To obtain (3.11), note that £ + 3v < + 3r + 3 = r!+Zr+6. □ 

Corollary 3.4. 

3/V 

(3-14) ]>>(n) = — + 0(log 2 iV). 

n=0 

Since t(2 r — 1) = j, the true error term is at least OilogN). Numerical computa- 
tions using Mathematica suggest that log 2 N can be replaced by log N log log N. It 
also seems that, at least for small fixed positive integers t, 

N-l , s 

(3.15) a t := lim iV-^L 

v 1 n^oo N ^ s(n + 1) 

exists. We have seen that a± = §, and if they exist, the evidence suggests that 
a 2 ~ 1.262, a 3 1.643 and a 4 ps 1.161. We are unable to present an explanation for 
these specific numerical values. 
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4. Stern Pairs, mod d 

We fix d > 2 with prime factorization d — Y[pT> e t — 1> anc ^ reca U the definitions 
of S d and S d (n) from (1.9) and (1.10). Let 

(4.1) N d = \S d \ , 
and for < % < d, let 

(4.2) N d (i) = \{j mod d : (i mod d, j mod d) G S d } \ . 

We now give two lemmas whose proofs rely on the Chinese Remainder Theorem. 
Lemma 4.1. The map S d : N — > S d is surjective. 

Proof. Suppose a = G S d with < i,j < d — 1. We shall show that there exists 
w G N so that gcd(i,j + wd) = 1. Consequently, there exists n with s(n) = i and 
s(n + 1) = j + wd, so that Sd(n) = a. 

Write i = n^/", ft > 1, with prime. If <^ | j, then f d. There exists 
w > so that w = d^ 1 (mod g/ £ ) if qe \ j and w = (mod g/ £ ) if <^ f j. Then 
j + toe? ^ (mod q/) for all so no prime dividing i divides j + wd, as desired. □ 

Lemma 4.2. For < i < d — 1, 

(4.3) N d = d 2 \{ P ^ and N d {i) = d \{ 

i Pe Pt \i Vi 

Proof. To compute iV d , we use the Chinese Remainder Theorem by counting the 
choices for (i mod p e /,j modp^) for each i. Missing are those in which pn 

divides both i and j, and so the total number of classes is (p e / — p e /~ ) 2 for each t. 

Now fix i. If p£ | i, then G S d if and only if pi \ j; if pi \ i, then there is no 

restriction on j. Thus, there are either p^ e —p\ l ~ l or pf choices for j, respectively. □ 

Suppose a = G S d ; let L(a) := + j) and R(a) = (i + j,j), where i + j 
is reduced mod d if necessary. Then L(a),R(a) G S d and the following lemma is 
immediate. 

Lemma 4.3. For all n, we have S d (2n) = L(S d (n)) and S d (2n + 1) = R(S d (n)). 

We now define the directed graph Q d as follows. The vertices of Q d are the elements 
of S d . The edges of Q d consist of (a,L(a)) and (a,R(a)) where a G S d . Iterating, 
we see that L k (a) = + kj) and R k (a) — (i + kj,j), so that L d = R d = id, and 
L^ 1 = L d ~ l and R' 1 = Thus, if (a, ($) is an edge of Q d) then there is a walk of 

length d — 1 from (3 to a. 

Each vertex of Q d has out-degree two; since (L~ 1 (o;), a) and (R~ 1 (a), a) are edges, 
each vertex has in-degree two as well. Let M d = [m,,^] = [m Q| g] denote the 
adjacency matrix for Q d : M d is the N d x 0-1 matrix so that m a L^ = m a R( a ) = 1, 
with other entries equal to 0. For a positive integer r, write 

(4-4) Ml = [m<g]; 
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(r) 

then m a g is the number of walks of length r from a to (3. Finally, for 7 G Sd, and 
integers U\ < XJ%, let 

(4.5) B( T , Ut, U 2 ) = \{m:U 1 <m<U 2 k S d (m) = 7 }| 
The following is essentially equivalent to Lemma 2.3. 

Lemma 4.4. Suppose a = Sd{m), (3 G Sd and r > 1. T/ien S(/3; 2 r m, 2 r (m + 1)) = 
m o/3 * s e 9 u °^ ^° ^ e number of walks of length r in Qd from a to (3. 

Proof. The walks of length 1 starting from a are (a,L(a)) and (a, R(a)); that is, 
(Sd(n), S d {2n)) and (Sd(n), Sd(2n + 1)). The rest is an easy induction. □ 

Lemma 4.5. For sufficiently large N, m£9 > for all a, (3. 

Proof. Let «o — (0, 1) = 5^(0). Note that L(ao) = «o, hence if there is a walk of 
length w from ao to 7, then there are such walks of every length > w. By Lemma 
4.1, for each a G Sd, there exists n a so that Sd(n a ) = a. Choose r sufficiently large 
that n a < 2 r for all a. Then by Lemma 4.4, for every 7, there is a walk of length r 
from a to 7, and so there is a walk of length (d — l)r from 7 to ckq. Thus, for any 
a, /3 G iSrf, there is at least one walk of length dr from a to (3 via «o- D 

We need a version of Perron-Frobenius. Observe that = \Md is doubly sto- 
chastic and the entries of = 2~ N are positive for sufficiently large N. Thus 
Ad is irreducible (see ^3], Ch.l), so it has a simple eigenvalue of 1, and all its other 
eigenvalues are inside the unit disk. It follows that Md has a simple eigenvalue of 2. 
Let 

(4.6) f d (T) = T k + c k ^T k - 1 + • • • + c 

be the minimal polynomial of Md- Let pd < 2 be the maximum modulus of any non-2 
root of fd, and let 1 + ad be the maximum multiplicity of any such maximal root. 
Then for r > and all (a, (3), 

(4.7) m r a f + c^mlf- 1 + ■■■ + c m^ = 0. 

It follows from the standard theory of linear recurrences that for some constants 

C a (3, 

(4.8) m r ap = c aP 2 T + (r ad p d ) as r -> 00. 

In particular, lim^oo A T d = A d0 := [c a p], and since A r d +1 = A d A r d , it follows that each 
column of A d0 is an eigenvector of A d , corresponding to A = 1. Such eigenvectors 
are constant vectors and since Ado is doubly stochastic, we may conclude that for all 
(a, 0), c a p = Then there exists c d > so that for r > and all (a, (3), 



(4.9) 



or 
r * 

m, 

d 



aP ~ Nd 
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Computations show that for for small values of d at least, pd = \ and a d = 0. In any 
event, by choosing 2 > pd > pd if o~d > 0, we can replace r a d d p r d by p~ d in the upper 
bound. Putting this together, we have proved the following theorem. 

Theorem 4.6. There exist constants Cd and pd < 2 so that if m G N and a G Sd, 
then for all r > 0, 



(4.10) 



B{a;2 r m,2 r {m + 1)) - — 

Nd 



< CdPd- 



We now use this result on blocks of length 2 r to get our main theorem. 
Theorem 4.7. For fixed d>2, there exists Td < 1 so that, for all a G Sd, 

(4.11) 

Proof. By (2.25), we have 
(4.12) 



B{a;0,N) = ^ + O{N T *). 
^d 



v-l 



B(a;0,N) = ^ B(a; Nj, N j+ i) = B ( a '> ^Mj, (Mj + 1)). 
3=0 j=i 

It follows that 

N 



(4.13) 



B(a;0,N) - 



< c d (p7 + ■ ■ ■ + P71 



If p d < 1, the upper bound is 0{r x ) = C(log N) = 0(N e ) for any e > 0. If 1 < p d < 2, 



U>,1 



log 2 



since 



N < 2 ri+1 . 



the upper bound is 0(p r d 1 ) = 0(N Td ) for r d 
Using the notation (1.11), we have 

(4.14) T{N;d,i)= £(«;0,A0, 

a=(i,j)eS d 

and the following is an immediate consequence of Lemma 4.2 and Theorem 4.7. 
Corollary 4.8. Suppose d > 2. Then 

(4.15) T(N;d,i) =r d ^N + 0(N Td ), 
where, recalling that d = YIpT > 



□ 



(4.16) 



rd,i 



- X\— TT 



Pi 



1 



For example, if p is prime, then f(j>, 0) = and f(p, i) 



when p \ i. 



In some sense, the model here is a Markov Chain, if we imagine going from m to 
2m or 2m + 1 with equal probability, so that the B((3; 2 r m, 2 r (m + l))'s represent the 
distribution of destinations after r steps. Ken Stolarsky has pointed out that [T7j is a 
somewhat different application of the limiting theory of Markov Chains in a number 
theoretic setting. 



12 



BRUCE REZNICK 



5. Small values of d 



It is immediate to see (and to prove) that 2 | s(n) if and only if 3 | n, thus S^n) 
cycles among {(0, 1), (1, 1), (1, 0)} and r 2 = 0. This generalizes to a family of partition 
sequences. Suppose d > 2 is fixed, and let b(d; n) denote the number of ways that n 
can be written in the form 



n = ^e,2\ eie{0,...,d-l} 

i>0 

1. It is shown in ^H] that 

oo oo 

Y,<n)X n = X\[(l + X v + X 



(5.1) 

so that 6(2; n) 
(5.2) 

n=0 j=0 

A standard partition argument shows that 



2 j+l 



(5.3) 



E = II 

n=0 j=0 



Thus, s(n) = b(3;n — 1). An examination of the product in (5.3) modulo 2 shows 
that b(d; n) is odd if and only if n = 0, 1 mod 6? (see jT^j, Theorems 5.2 and 2.14.) 
Suppose now that d = 3. Write the 8 elements of 1S3 in lexicographic order: 

(5.4) (0, 1), (0, 2), (1, 0), (1, 1), (1, 2), (2, 0), (2, 1), (2, 2). 

Then in the notation of the last section, 



(5.5) 



M, 



/I 








1 
















1 

















1 








1 


1 


























1 





1 








1 


1 
































1 





1 


1 














1 








Vo 











1 





1 


0/ 



2)(T-n)(T-fi), 



The minimal polynomial of M 3 is 

(5.6) / 3 (T) =T 5 — 2T 4 + T 3 - AT 2 + AT = T(T - 1)(T 
where 

-l + y/7i _ -1-V7i 
(5-7) n= , fi= . 

Since the roots of f 3 are distinct, we see that for each (a, (3) G <S 3 , for r > 1, there 
exist constants t;^ so that 



(51 



rn 



(r) 
a/3 



1 



1 



Vapi + + v a p 3 fi r + -.2 r = --2 r + 0(2 
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(As it happens, there are only eight distinct sequences m£l.) Corollary 4.8 then 
implies that 

T(N;3,0) = j + O{VN), 

3/V 3/V 
T(N; 3, 1) = — + 0(VN), T(N; 3, 2) = — + 0(VN). 

Since T(N; 3, 0) + T(N; 3, 1) + T(N; 3, 2) = N, we gain complete information from 
studying T(N; 3, 0) and 

(5.10) A(N) = A 3 (N) := T(N; 3, 1) - T(N; 3, 2). 

(That is, A 3 (jV+l)-A 3 (iV) equals 0, 1, -1 when s(N) = 0, 1,2 mod 3, respectively.) 
To study T(N; 3, 0), we first define the set i 3 cN recursively by: 

(5.11) 0,5,7e4 0<n€ A 3 2n,8n±5,8n±7G A 3 . 
Thus, 

(5.12) A 3 = {0,5,7,10,14,20,28,33,35,40,45,47,49,51,56,61,63,...}. 
Theorem 5.1. If n > 0, then 3 | s(n) if and only if n G A 3 . 

Proof. It follows recursively from (1.2) or directly from (2.7) that 

(5.13) s(2n) = s(n), s(8n±5) = 2s(n)+3s(n±l), s(8n±7) = s(n)+3s(n±l). 

Thus, 3 divides s(n) if and only if 3 divides s(2n),s(8n ± 5) or s(Sn ± 7). Since 
every n > 1 can be written uniquely as 2n', 8n' ± 5 or 8n' ± 7 with < n' < n, the 
description of A 3 is complete. □ 

In the late 1970's, E. W. Dijkstra [7 (pp. 215-6, 230-232) studied the Stern se- 
quence under the name "fuse", and gave a different description of A 3 (p. 232): 

Inspired by a recent exercise of Don Knuth I tried to characterize the 
arguments n such that 3 | fusc(n). With braces used to denote zero or 
more instances of the enclosed, the vertical bar as the BNF 'or', and the 
question mark '?' to denote either a or a 1, the syntactical represen- 
tation for such an argument (in binary) is {0}1{?0{1}0|?1{0}1}?1{0}. 
I derived this by considering - as a direct derivation of my program - 
the finite state automaton that computes fuse (N) mod 3. 

Let 

(5.14) a r = \{n E A 3 : 2 r < n < 2 r+1 }| = T(2 r+1 ; 3, 0) - T(2 r ; 3, 0). 
It follows from (5.12) that 

(5.15) ao — a% — 0, a 2 = a 3 = a 4 = 2, as = 10. 
Lemma 5.2. For r > 3, (a r ) satisfies the recurrence 

(5.16) a r = a r -i + 4a r _3. 
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Proof. This is evidently true for r = 3,4,5. If 2 r < n < 2 r+1 and n = 2n', then 
2 r ~ 1 < n' < 2 r , so the even elements of A 3 counted in a r come from elements of 
A 3 counted in a r _i. If 2 r < n < 2 r+1 and n = 8n' ± 5 or n = 8n' ± 7, then 
2 r ~ 3 < n' < 2 r ~ 2 and n' e A 3 . Thus the odd elements of A 3 counted in a r come (in 
fours) from elements of A 3 counted in a r _ 3 . □ 

The characteristic polynomial of the recurrence (5.16) is T 3 — T 2 — 4 (necessarily a 
factor of fs(T)), and has roots T — 2, and /i. The details of the following routine 
computation are omitted. 

Theorem 5.3. For r > 0, we have the exact formula 



(5.17) 



1 ^TWTA / -7-5V7* 

4 I 56 r \ 56 



Keeping in mind that s(0) = is not counted in any a r , we find after a further 
computation that the error estimate 0{y/~N) is best possible for T(N; 3, 0): 

Corollary 5.4. 



(5.18) 



T(2 r ;3,0) 




7 + V7i\ r , 1 
-56~ P + 2 



To study A(iV), we first need a somewhat surprising lemma. 
Lemma 5.5. For all N, A(2N) = A (47V). 

Proof The simplest proof is by induction, and the assertion is trivial for iV 
There are eight possible "short" diatomic arrays modulo 3: 



= 0. 



(5.19) 









s(N) 
s(2N) 
s{4N) 

1 

1 1 
1 2 1 



s(N + l) 

s(2N + l) s(2N + 2) = 

s(4N + l) s{4N + 2) s(4iV + 3) s{AN + 4) 





2 
2 2 



2 
2 

1 2 



2 1 1 









1 

1 2 
1 2 







1 2 
10 2 
110 2 2 



2 
2 2 
2 12 2 



2 1 
2 1 
2 2 1 1 



2 
2 

2 



2 
2 

2 



By counting the elements in the rows mod 3 in each case, we see that A(2iV + 2) — 
A(2iV) = A(4iV + 4) - A(4iV) is equal to: 1, -1, 2, 0, 1, -2, -1, 0, respectively. □ 
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Theorem 5.6. For all n, A(n) G {0, 1,2,3}. More specifically, 



(5.20) 



s 3 




= (o,i) = 


=> A(2m) 


= 0, 


A(2m + 1) 


= 0; 


s 3 


» 


= (0,2) = 


=> A(2m) 


= 3, 


A(2m + 1) 


= 3; 


s 3 


» 


= (1,*) = 


=> A(2m) 


= 1, 


A(2m + 1) 


= 2; 


s 3 


» 


= (2,*) = 


=> A(2m) 


= 2, 


A(2m + 1) 


= 1. 



Proof. To prove the theorem, we first observe that it is correct for m < 4. We now 
assume it is true for m < 2 r and prove it for 2 r < m < 2 r+1 . There are sixteen 
cases: m can be even or odd and there are 8 choices for S 3 (m). As a representative 
example, suppose S 3 (m) = (2, 1). We shall consider the cases m = 2t and m = 2t + 1 
separately. The proofs for the other seven choices of S 3 (m) are very similar and are 
omitted. 

Suppose first that m = 2t < 2 r+1 . Then S 3 {m) = S 3 {2t) = (2,1), hence S 3 {t) = 
(2, 2). We have A(2m) = 2 by hypothesis, and hence A(4m) = 2 by Lemma 5.5. The 
eighth array in (5.19) shows that s(4t) = 2 mod 3, so that A(4m+1) = A(4m) — 1 = 1, 
as asserted in (5.20). 

If, on the other hand, m = 2t + 1 < 2 r+1 and S 3 (m) = S 3 (2t + 1) = (2, 1), then 
S 3 (t) = (1, 1). We now have A(2t) = 1 and A(2t+1) = 2 by hypothesis and A(4t) = 1 
by Lemma 5.5. The fourth array in (5.19) shows that (s(4£), s(4t + 1), s(4t + 2)) = 
(1, 0, 2) mod 3. Thus, it follows that A(2m) = A(4t + 2) = A(4t) + 1 + = 2 and 
A(2m + 1) = A(4t + 3) = A(4i + 2) - 1 = 1, again as desired. □ 

Since S 3 (m) is uniformly distributed on S 3 , (5.20) shows that A(n) takes the values 
(0, 1, 2, 3) with limiting probability (|, |, |, |). 

We conclude with a few words about the results announced at the end of the first 
section, but not proved here. For each (d, i), T(2 r ; d, i) will satisfy a recurrence whose 
characteristic equation is a factor of the minimal polynomial of Sd- It happens that 
T(2 r ;4,0) = T(2 r ;5,0) for small values of r and both satisfy the recurrence with 
characteristic polynomial T 4 — 2T 3 + T 2 — 4 (roots: 2, —1, — r, — f) so that equality 
holds for all r. The same applies to T(2 r ;6,0) = T(2 r ;9,0) = T(2 r ;ll,0), with a 
more complicated recurrence. Results similar to Lemma 5.5 and Theorem 5.6 hold 
for d = 4, with a similar proof; Antonios Hondroulis has shown that this is also true 
for d = 6. No result has been found yet for d = 5, although a Mathematica check 
for N < 2 19 shows that -5 < T(N; 5, 1) - T(iV;5,4) < 11. These topics will be 
discussed in greater detail in [T5j . 
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